On the Use of Spearman's Rho to Measure the Stability of Feature Rankings

نویسندگان

  • Sarah Nogueira
  • Konstantinos Sechidis
  • Gavin Brown
چکیده

Producing stable feature rankings is critical in many areas, such as in bioinformatics where the robustness of a list of ranked genes is crucial to interpretation by a domain expert. In this paper, we study Spearman’s rho as a measure of stability to training data perturbations not just as a heuristic, but here proving that it is the natural measure of stability when using mean rank aggregation. We provide insights on the properties of this stability measure, allowing a useful interpretation of stability values e.g. how close a stability value is to that of a purely random feature ranking process, and concepts such as the expected value of a stability estimator.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine

Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods.  In filter methods, features subsets are selected due to some measu...

متن کامل

Adolescents' perceptions of social status: development and evaluation of a new indicator.

OBJECTIVE Eliminating health disparities, including those that are a result of socioeconomic status (SES), is one of the overarching goals of Healthy People 2010. This article reports on the development of a new, adolescent-specific measure of subjective social status (SSS) and on initial exploratory analyses of the relationship of SSS to adolescents' physical and psychological health. METHOD...

متن کامل

Mathematical Modeling of Cancer Cells and Chemotherapy Protocol Dealing Optimization Using Fuzzy Differential Equations And Lypunov Stability Criterion

Mathematical models can simulate the growth and proliferation of cells in the interaction with healthy cells, the immune system and measure the toxicity of drug and its effects on healthy tissue pay. One of the main goals of modeling the structure and growth of cancer cells is to find a control model suitable for administration among patients. In this study, a new mathematical model is designed...

متن کامل

A Geometric View of Similarity Measures in Data Mining

The main objective of data mining is to acquire information from a set of data for prospect applications using a measure. The concerning issue is that one often has to deal with large scale data. Several dimensionality reduction techniques like various feature extraction methods have been developed to resolve the issue. However, the geometric view of the applied measure, as an additional consid...

متن کامل

Rho Kinase Inhibitors as a Novel Treatment for Glaucoma and Ocular Hypertension

In an elegant example of bench-to-bedside research, a hypothesis that cells in the outflow pathway actively regulate conventional outflow resistance was proposed in the 1990s and systematically pursued, exposing novel cellular and molecular mechanisms of intraocular pressure (IOP) regulation. The critical discovery that pharmacologic manipulation of the cytoskeleton of outflow pathway cells dec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017